User Modeling of Parallel Workloads
نویسندگان
چکیده
The goal of workload modeling is to simulate the expected workload, accurately enough to enable making correct design and administrative decisions. Several statistical features of production parallel computer workloads, which are not embodied in current models, have been identified. Their practical importance is demonstrated by two new kinds of schedulers – a key component in determining the overall performance of a parallel computer. The first is adaptive scheduling, which takes advantages of the locality of sampling and known cycles in parallel workloads, and achieves an average improvement of 10% in performance and 35% in stability for the tested production workloads. The second is shortest-jobbackfill-first scheduling, which relies on runtime prediction, done by analyzing user and session histories. These schedulers cannot be correctly evaluated by existing workload models, and we argue that the correct approach for future workload models (as well as on-line algorithms) is user and session-based modeling, instead of modeling jobs directly as done today. As the basis for such a model, we use PCA to provide variable sets which explain over 80% of the variance between users and sessions, and clustering to identify five stable session clusters and four stable user clusters. We then model the distributions of the arrival and activity patterns of both users and sessions, including a complete analysis of their dependencies and temporal structure. The model is based on logs from seven different parallel supercomputers, spanning over 87 months, analyzed together to ensure that results are location and architecture-neutral.
منابع مشابه
Workload modeling for parallel computers
The availability of good workload models is essential for the design and analysis of parallel computer systems. A workload model can be applied directly in an experimental or simulation environment to verify new scheduling policies or strategies. Moreover, it can be used for extrapolating and predicting future workload conditions. In this work, we focus on the workload modeling for parallel com...
متن کاملAnalysis and Modeling of Social Influence in High Performance Computing Workloads
Analysis and Modeling of Social Influence in High Performance Computing Workloads Shuai Zheng High Performance Computing (HPC) is becoming a common tool in many research areas. Social influence (e.g., project collaboration) among increasing users of HPC systems creates bursty behavior in underlying workloads. This bursty behavior is increasingly common with the advent of grid computing and clou...
متن کاملNursing Workloads and Psychological Empowerment in Hospitals: Structural Equations Modeling
Background: The high workload of nurses in hospitals has been identified as a patient safety and worker stress problem. Psychological empowerment is a motivational concept demonstrated in four dimensions: meaning, competence, self-determination, and impact. Objective: This study investigated the relationship between nurses’ workloads and psychological emp...
متن کاملModeling Parallel System Workloads with Temporal Locality
In parallel systems, similar jobs tend to arrive within bursty periods. This fact leads to the existence of the locality phenomenon, a persistent similarity between nearby jobs, in real parallel computer workloads. This important phenomenon deserves to be taken into account and used as a characteristic of any workload model. Regrettably, this property has received little if any attention of res...
متن کاملUtilization, Predictability, Workloads, and User Runtime Estimates in Scheduling the IBM SP2 with Backfilling
ÐScheduling jobs on the IBM SP2 system and many other distributed-memory MPPs is usually done by giving each job a partition of the machine for its exclusive use. Allocating such partitions in the order in which the jobs arrive (FCFS scheduling) is fair and predictable, but suffers from severe fragmentation, leading to low utilization. This situation led to the development of the EASY scheduler...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006